62 resultados para Complete genome sequencing

em National Center for Biotechnology Information - NCBI


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Bipolar mood disorder (BP) is a debilitating syndrome characterized by episodes of mania and depression. We designed a multistage study to detect all major loci predisposing to severe BP (termed BP-I) in two pedigrees drawn from the Central Valley of Costa Rica, where the population is largely descended from a few founders in the 16th–18th centuries. We considered only individuals with BP-I as affected and screened the genome for linkage with 473 microsatellite markers. We used a model for linkage analysis that incorporated a high phenocopy rate and a conservative estimate of penetrance. Our goal in this study was not to establish definitive linkage but rather to detect all regions possibly harboring major genes for BP-I in these pedigrees. To facilitate this aim, we evaluated the degree to which markers that were informative in our data set provided coverage of each genome region; we estimate that at least 94% of the genome has been covered, at a predesignated threshold determined through prior linkage simulation analyses. We report here the results of our genome screen for BP-I loci and indicate several regions that merit further study, including segments in 18q, 18p, and 11p, in which suggestive lod scores were observed for two or more contiguous markers. Isolated lod scores that exceeded our thresholds in one or both families also occurred on chromosomes 1, 2, 3, 4, 5, 7, 13, 15, 16, and 17. Interesting regions highlighted in this genome screen will be followed up using linkage disequilibrium (LD) methods.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The complete genome sequence of Caulobacter crescentus was determined to be 4,016,942 base pairs in a single circular chromosome encoding 3,767 genes. This organism, which grows in a dilute aquatic environment, coordinates the cell division cycle and multiple cell differentiation events. With the annotated genome sequence, a full description of the genetic network that controls bacterial differentiation, cell growth, and cell cycle progression is within reach. Two-component signal transduction proteins are known to play a significant role in cell cycle progression. Genome analysis revealed that the C. crescentus genome encodes a significantly higher number of these signaling proteins (105) than any bacterial genome sequenced thus far. Another regulatory mechanism involved in cell cycle progression is DNA methylation. The occurrence of the recognition sequence for an essential DNA methylating enzyme that is required for cell cycle regulation is severely limited and shows a bias to intergenic regions. The genome contains multiple clusters of genes encoding proteins essential for survival in a nutrient poor habitat. Included are those involved in chemotaxis, outer membrane channel function, degradation of aromatic ring compounds, and the breakdown of plant-derived carbon sources, in addition to many extracytoplasmic function sigma factors, providing the organism with the ability to respond to a wide range of environmental fluctuations. C. crescentus is, to our knowledge, the first free-living α-class proteobacterium to be sequenced and will serve as a foundation for exploring the biology of this group of bacteria, which includes the obligate endosymbiont and human pathogen Rickettsia prowazekii, the plant pathogen Agrobacterium tumefaciens, and the bovine and human pathogen Brucella abortus.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The 1,852,442-bp sequence of an M1 strain of Streptococcus pyogenes, a Gram-positive pathogen, has been determined and contains 1,752 predicted protein-encoding genes. Approximately one-third of these genes have no identifiable function, with the remainder falling into previously characterized categories of known microbial function. Consistent with the observation that S. pyogenes is responsible for a wider variety of human disease than any other bacterial species, more than 40 putative virulence-associated genes have been identified. Additional genes have been identified that encode proteins likely associated with microbial “molecular mimicry” of host characteristics and involved in rheumatic fever or acute glomerulonephritis. The complete or partial sequence of four different bacteriophage genomes is also present, with each containing genes for one or more previously undiscovered superantigen-like proteins. These prophage-associated genes encode at least six potential virulence factors, emphasizing the importance of bacteriophages in horizontal gene transfer and a possible mechanism for generating new strains with increased pathogenic potential.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The genome of the crenarchaeon Sulfolobus solfataricus P2 contains 2,992,245 bp on a single chromosome and encodes 2,977 proteins and many RNAs. One-third of the encoded proteins have no detectable homologs in other sequenced genomes. Moreover, 40% appear to be archaeal-specific, and only 12% and 2.3% are shared exclusively with bacteria and eukarya, respectively. The genome shows a high level of plasticity with 200 diverse insertion sequence elements, many putative nonautonomous mobile elements, and evidence of integrase-mediated insertion events. There are also long clusters of regularly spaced tandem repeats. Different transfer systems are used for the uptake of inorganic and organic solutes, and a wealth of intracellular and extracellular proteases, sugar, and sulfur metabolizing enzymes are encoded, as well as enzymes of the central metabolic pathways and motility proteins. The major metabolic electron carrier is not NADH as in bacteria and eukarya but probably ferredoxin. The essential components required for DNA replication, DNA repair and recombination, the cell cycle, transcriptional initiation and translation, but not DNA folding, show a strong eukaryal character with many archaeal-specific features. The results illustrate major differences between crenarchaea and euryarchaea, especially for their DNA replication mechanism and cell cycle processes and their translational apparatus.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

FULL-malaria is a database for a full-length-enriched cDNA library from the human malaria parasite Plasmodium falciparum (http://133.11.149.55/). Because of its medical importance, this organism is the first target for genome sequencing of a eukaryotic pathogen; the sequences of two of its 14 chromosomes have already been determined. However, for the full exploitation of this rapidly accumulating information, correct identification of the genes and study of their expression are essential. Using the oligo-capping method, we have produced a full-length-enriched cDNA library from erythrocytic stage parasites and performed one-pass reading. The database consists of nucleotide sequences of 2490 random clones that include 390 (16%) known malaria genes according to BLASTN analysis of the nr-nt database in GenBank; these represent 98 genes, and the clones for 48 of these genes contain the complete protein-coding sequence (49%). On the other hand, comparisons with the complete chromosome 2 sequence revealed that 35 of 210 predicted genes are expressed, and in addition led to detection of three new gene candidates that were not previously known. In total, 19 of these 38 clones (50%) were full-length. From these obser­vations, it is expected that the database contains ∼1000 genes, including 500 full-length clones. It should be an invaluable resource for the development of vaccines and novel drugs.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The Plasmodium falciparum Genome Database (http://PlasmoDB.org) integrates sequence information, automated analyses and annotation data emerging from the P.falciparum genome sequencing consortium. To date, raw sequence coverage is available for >90% of the genome, and two chromosomes have been finished and annotated. Data in PlasmoDB are organized by chromosome (1–14), and can be accessed using a variety of tools for graphical and text-based browsing or downloaded in various file formats. The GUS (Genomics Unified Schema) implementation of PlasmoDB provides a multi-species genomic relational database, incorporating data from human and mouse, as well as P.falciparum. The relational schema uses a highly structured format to accommodate diverse data sets related to genomic sequence and gene expression. Tools have been designed to facilitate complex biological queries, including many that are specific to Plasmodium parasites and malaria as a disease. Additional projects seek to integrate genomic information with the rich data sets now becoming available for RNA transcription, protein expression, metabolic pathways, genetic and physical mapping, antigenic and population diversity, and phylogenetic relationships with other apicomplexan parasites. The overall goal of PlasmoDB is to facilitate Internet- and CD-ROM-based access to both finished and unfinished sequence information by the global malaria research community.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

One challenge presented by large-scale genome sequencing efforts is effective display of uniform information to the scientific community. The Comprehensive Microbial Resource (CMR) contains robust annotation of all complete microbial genomes and allows for a wide variety of data retrievals. The bacterial information has been placed on the Web at http://www.tigr.org/CMR for retrieval using standard web browsing technology. Retrievals can be based on protein properties such as molecular weight or hydrophobicity, GC-content, functional role assignments and taxonomy. The CMR also has special web-based tools to allow data mining using pre-run homology searches, whole genome dot-plots, batch downloading and traversal across genomes using a variety of datatypes.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We present here the complete genome sequence of a common avian clone of Pasteurella multocida, Pm70. The genome of Pm70 is a single circular chromosome 2,257,487 base pairs in length and contains 2,014 predicted coding regions, 6 ribosomal RNA operons, and 57 tRNAs. Genome-scale evolutionary analyses based on pairwise comparisons of 1,197 orthologous sequences between P. multocida, Haemophilus influenzae, and Escherichia coli suggest that P. multocida and H. influenzae diverged ≈270 million years ago and the γ subdivision of the proteobacteria radiated about 680 million years ago. Two previously undescribed open reading frames, accounting for ≈1% of the genome, encode large proteins with homology to the virulence-associated filamentous hemagglutinin of Bordetella pertussis. Consistent with the critical role of iron in the survival of many microbial pathogens, in silico and whole-genome microarray analyses identified more than 50 Pm70 genes with a potential role in iron acquisition and metabolism. Overall, the complete genomic sequence and preliminary functional analyses provide a foundation for future research into the mechanisms of pathogenesis and host specificity of this important multispecies pathogen.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Previously conducted sequence analysis of Arabidopsis thaliana (ecotype Columbia-0) reported an insertion of 270-kb mtDNA into the pericentric region on the short arm of chromosome 2. DNA fiber-based fluorescence in situ hybridization analyses reveal that the mtDNA insert is 618 ± 42 kb, ≈2.3 times greater than that determined by contig assembly and sequencing analysis. Portions of the mitochondrial genome previously believed to be absent were identified within the insert. Sections of the mtDNA are repeated throughout the insert. The cytological data illustrate that DNA contig assembly by using bacterial artificial chromosomes tends to produce a minimal clone path by skipping over duplicated regions, thereby resulting in sequencing errors. We demonstrate that fiber-fluorescence in situ hybridization is a powerful technique to analyze large repetitive regions in the higher eukaryotic genomes and is a valuable complement to ongoing large genome sequencing projects.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Since 1991, the Rice Genome Research Program in Japan has carried out rice genomics, such as large-scale cDNA analysis, construction of a fine-scale restriction fragment length polymorphism map, and physical mapping of the rice genome with yeast artificial chromosome clones. These studies have made a great impact on research into grass genomes and made rice a model plant for other cereal crop research. Starting in 1998, the Rice Genome Research Program will step into a new stage of genomics—that of genome sequencing. This project eventually should reveal all of the genomic sequence information in the rice plant and be an indispensable aid in understanding the genomics of other grass species.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The determination of complete genome sequences provides us with an opportunity to describe and analyze evolution at the comprehensive level of genomes. Here we compare nine genomes with respect to their protein coding genes at two levels: (i) we compare genomes as “bags of genes” and measure the fraction of orthologs shared between genomes and (ii) we quantify correlations between genes with respect to their relative positions in genomes. Distances between the genomes are related to their divergence times, measured as the number of amino acid substitutions per site in a set of 34 orthologous genes that are shared among all the genomes compared. We establish a hierarchy of rates at which genomes have changed during evolution. Protein sequence identity is the most conserved, followed by the complement of genes within the genome. Next is the degree of conservation of the order of genes, whereas gene regulation appears to evolve at the highest rate. Finally, we show that some genomes are more highly organized than others: they show a higher degree of the clustering of genes that have orthologs in other genomes.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The adrenoleukodystrophy protein (ALDP) and the 70-kDa peroxisomal membrane protein (PMP70) are half ATP-binding cassette (ABC) transporters in the human peroxisome membrane. ALDP and PMP70 share sequence homology and both are implicated in genetic diseases. PXA1 and YKL741 are Saccharomyces cerevisiae genes that encode homologs of ALDP and PMP70. Pxa1p, a putative ortholog of ALDP, is involved in peroxisomal beta-oxidation of fatty acids while YKL741 is an open reading frame found by the yeast genome sequencing project. Here we designate YKL741 as PXA2 and show that its protein product, Pxa2p, like Pxa1p, is associated with peroxisomes but not required for their assembly. Yeast strains carrying gene disruption of PXA1, PXA2, or both have similar and, in the case of the latter, nonadditive phenotypes. We also find that the stability of Pxa1p, but not Pxa2p, is markedly reduced in the absence of the other. Finally, we find that Pxa1p and Pxa2p coimmuno-precipitate. These genetic and physical data suggest that Pxa1p and Pxa2p heterodimerize to form a complete peroxisomal ABC transporter involved in fatty acid beta-oxidation. This result predicts the presence of similar heterodimeric ABC transporters in the mammalian peroxisome membrane.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Millions of people worldwide suffer from nutritional imbalances of essential metals like zinc. These same metals, along with pollutants like cadmium and lead, contaminate soils at many sites around the world. In addition to posing a threat to human health, these metals can poison plants, livestock, and wildlife. Deciphering how metals are absorbed, transported, and incorporated as protein cofactors may help solve both of these problems. For example, edible plants could be engineered to serve as better dietary sources of metal nutrients, and other plant species could be tailored to remove metal ions from contaminated soils. We report here the cloning of the first zinc transporter genes from plants, the ZIP1, ZIP2, and ZIP3 genes of Arabidopsis thaliana. Expression in yeast of these closely related genes confers zinc uptake activities. In the plant, ZIP1 and ZIP3 are expressed in roots in response to zinc deficiency, suggesting that they transport zinc from the soil into the plant. Although expression of ZIP2 has not been detected, a fourth related Arabidopsis gene identified by genome sequencing, ZIP4, is induced in both shoots and roots of zinc-limited plants. Thus, ZIP4 may transport zinc intracellularly or between plant tissues. These ZIP proteins define a family of metal ion transporters that are found in plants, protozoa, fungi, invertebrates, and vertebrates, making it now possible to address questions of metal ion accumulation and homeostasis in diverse organisms.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Bacillus subtilis strain ATCC6633 has been identified as a producer of mycosubtilin, a potent antifungal peptide antibiotic. Mycosubtilin, which belongs to the iturin family of lipopeptide antibiotics, is characterized by a β-amino fatty acid moiety linked to the circular heptapeptide Asn-Tyr-Asn-Gln-Pro-Ser-Asn, with the second, third, and sixth position present in the D-configuration. The gene cluster from B. subtilis ATCC6633 specifying the biosynthesis of mycosubtilin was identified. The putative operon spans 38 kb and consists of four ORFs, designated fenF, mycA, mycB, and mycC, with strong homologies to the family of peptide synthetases. Biochemical characterization showed that MycB specifically adenylates tyrosine, as expected for mycosubtilin synthetase, and insertional mutagenesis of the operon resulted in a mycosubtilin-negative phenotype. The mycosubtilin synthetase reveals features unique for peptide synthetases as well as for fatty acid synthases: (i) The mycosubtilin synthase subunit A (MycA) combines functional domains derived from peptide synthetases, amino transferases, and fatty acid synthases. MycA represents the first example of a natural hybrid between these enzyme families. (ii) The organization of the synthetase subunits deviates from that commonly found in peptide synthetases. On the basis of the described characteristics of the mycosubtilin synthetase, we present a model for the biosynthesis of iturin lipopeptide antibiotics. Comparison of the sequences flanking the mycosubtilin operon of B. subtilis ATCC6633, with the complete genome sequence of B. subtilis strain 168 indicates that the fengycin and mycosubtilin lipopeptide synthetase operons are exchanged between the two B. subtilis strains.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A recent study of the divergence times of the major groups of organisms as gauged by amino acid sequence comparison has been expanded and the data have been reanalyzed with a distance measure that corrects for both constraints on amino acid interchange and variation in substitution rate at different sites. Beyond that, the availability of complete genome sequences for several eubacteria and an archaebacterium has had a great impact on the interpretation of certain aspects of the data. Thus, the majority of the archaebacterial sequences are not consistent with currently accepted views of the Tree of Life which cluster the archaebacteria with eukaryotes. Instead, they are either outliers or mixed in with eubacterial orthologs. The simplest resolution of the problem is to postulate that many of these sequences were carried into eukaryotes by early eubacterial endosymbionts about 2 billion years ago, only very shortly after or even coincident with the divergence of eukaryotes and archaebacteria. The strong resemblances of these same enzymes among the major eubacterial groups suggest that the cyanobacteria and Gram-positive and Gram-negative eubacteria also diverged at about this same time, whereas the much greater differences between archaebacterial and eubacterial sequences indicate these two groups may have diverged between 3 and 4 billion years ago.